Characterizing Probability-based Uniform Sampling for Surrogate Modeling

نویسندگان

  • Junqiang Zhang
  • Souma Chowdhury
  • Achille Messac
چکیده

The fidelity of surrogate models remains one of the primary concerns in their application to represent complex system behavior. Appropriate sampling of training points is one of the primary factors affecting the fidelity of surrogate models. This paper investigates the relative advantage of probability-based uniform sampling over distance-based uniform sampling, in training surrogate models whose system inputs follow a distribution. The inputs to a surrogate model can be variables with known, assumed, or predefined probability of occurrence. Conventional distance-based sampling involves metrics defined in terms of coordinate-distances in the actual variable space. This paper instead uses the probability of the variables as the metric for sampling. This sampling approach inversely transforms a designated sequence of probabilities to their corresponding values using inverse Cumulative Distribution Function (CDF). Probability-based sample points thus determined are used as training points to develop the surrogate model. If the sequence of probabilities is uniform, the obtained probability-based sampling points are uniform in terms of probability. To obtain the same number of distance-based uniform sample points, the same sequence is directly scaled into the coordinates of training points. To study the suitability of probability-based uniform sampling for surrogate modeling, Mean Squared Error (MSE) of a monomial form is formulated based on the relationship between the squared error of a surrogate model and the volume or hypervolume per sample point. Using varying size of the uniformly-distributed training data set, the squared errors of the surrogate models are fitted as a monomial function of the hypervolume per sample point. The fidelities of the two surrogate models developed using probability-based and distance-based uniform sampling are compared using the monomial MSE function. The exponent of the monomial MSE function indicates which of the two surrogate models provides higher fidelity. When the exponent of the monomial function is between 0 and 1, the fidelity of the surrogate model trained using probability-based uniform sampling is higher than that of the surrogate models trained using distance-based uniform sampling. When the value of the exponent is greater than 1, the fidelity comparison is reversed. This theoretical conclusion is successfully verified using standard test functions. In this paper, the probability-based uniform sampling is also applied to the development of surrogate models for the thermal performance evaluation of windows. The conditional advantage of probability-based uniform sampling (formulated in this paper) also holds true for the window problem. [1] F. P. Miller, A. F. Vandome, and M. B. John. Inverse Transform Sampling. VDM Publishing, Saarbrcken, Germany, 2010. [2] E. L. Lehmann and G. Casella. Theory of Point Estimation. Springer, New York, 2nd edition, 1998. [3] J Zhang. Characterizing Probability-based Sampling for High-fidelity Surrogate Modeling. PhD dissertation, Rensselaer Polytechnic Institute, 2012.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modified Bayesian Kriging for noisy response problems and Bayesian confidence-based reliability-based design optimization

The objective of this study is to develop a new modified Bayesian Kriging (MBKG) surrogate modeling method that can be used to carry out confidence-based reliability-based design optimization (RBDO) for problems in which simulation analyses are inherently noisy and standard Kriging approaches fail. The formulation of the MBKG surrogate modeling method is presented, and the full conditional dist...

متن کامل

Modified Bayesian Kriging for Noisy Response Problems for Reliability Analysis

This paper develops a new modified Bayesian Kriging (MBKG) surrogate modeling method for problems in which simulation analyses are inherently noisy and thus standard Kriging approaches fail to properly represent the responses. The purpose is to develop a method that can be used to carry out reliability analysis to predict probability of failure. The formulation of the MBKG surrogate modeling me...

متن کامل

Reliability-based design optimization using surrogate model with assessment of confidence level

The objective of this study is to develop an accurate surrogate modeling method for construction of the surrogate model to represent the performance measures of the compute-intensive simulation model in reliability-based design optimization (RBDO). In addition, an assessment method for the confidence level of the surrogate model and a conservative surrogate model to account the uncertainty of t...

متن کامل

Evaluation of estimation methods for parameters of the probability functions in tree diameter distribution modeling

One of the most commonly used statistical models for characterizing the variations of tree diameter at breast height is Weibull distribution. The usual approach for estimating parameters of a statistical model is the maximum likelihood estimation (likelihood method). Usually, this works based on iterative algorithms such as Newton-Raphson. However, the efficiency of the likelihood method is not...

متن کامل

Surrogate Modeling for Uncertainty Assessment with Application to Aviation Environmental System Models

Numerical simulation models to support decision-making and policy-making processes are often complex, involving many disciplines, many inputs, and long computation times. Inputs to such models are inherently uncertain, leading to uncertainty in model outputs. Characterizing, propagating, and analyzing this uncertainty is critical both to model development and to the effective application of mod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013